Safety-Aware Apprenticeship Learning

نویسندگان

  • Weichao Zhou
  • Wenchao Li
چکیده

Apprenticeship learning (AL) is a class of “learning from demonstrations” techniques where the reward function of a Markov Decision Process (MDP) is unknown to the learning agent and the agent has to derive a good policy by observing an expert’s demonstrations. In this paper, we study the problem of how to make AL algorithms inherently safe while still meeting its learning objective. We consider a setting where the unknown reward function is assumed to be a linear combination of a set of state features, and the safety property is specified in Probabilistic Computation Tree Logic (PCTL). By embedding probabilistic model checking inside AL, we propose a novel counterexample-guided approach that can ensure both safety and performance of the learnt policy. We demonstrate the effectiveness of our approach on several challenging AL scenarios where safety is essential.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Comparison of Students’ Perception of Preparedness for Interprofessional learning readiness in apprenticeship and apprenticeship on site in Schools of Nursing and Midwifery of Islamic Azad Universities in Isfahan, Iran in 2018

Background & Objective: Interprofessional education (IPE) is one of the new approaches in the education of students in health-related disciplines. This type of training can increase interprofessional collaborations, thereby improving patient care quality. This study aimed to compare the perception of IPE in students apprenticeship and apprenticeship on site in schools of nursing and midwifery o...

متن کامل

Early Start in Software Coaching

The demand for software coaching and coaches is increasing. As our programming courses are organized according to the Extreme Apprenticeship method, it is relatively safe and straightforward to allow students to participate as coaches in our CS1 course even as early as their second semester. Safety is ensured by the hierarchical structure of CS1 course personnel that provides enough peer and fa...

متن کامل

Situating Learning in the Workplace: Having Another Look at Apprenticeships

This article examines the acquisition of vocational skills through apprenticeship-type situated learning. Findings from a studies of skilled workers revealed that learning processes that were consonant with the apprenticeship model of learning were highly valued as a means of acquiring and maintaining vocational skills. Supported by current research and theorising, this article, describes some ...

متن کامل

Generalizing Apprenticeship Learning across Hypothesis Classes

This paper develops a generalized apprenticeship learning protocol for reinforcementlearning agents with access to a teacher who provides policy traces (transition and reward observations). We characterize sufficient conditions of the underlying models for efficient apprenticeship learning and link this criteria to two established learnability classes (KWIK and Mistake Bound). We then construct...

متن کامل

Hierarchical Apprenticeship Learning with Application to Quadruped Locomotion

We consider apprenticeship learning—learning from expert demonstrations—in the setting of large, complex domains. Past work in apprenticeship learning requires that the expert demonstrate complete trajectories through the domain. However, in many problems even an expert has difficulty controlling the system, which makes this approach infeasible. For example, consider the task of teaching a quad...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1710.07983  شماره 

صفحات  -

تاریخ انتشار 2017